REMARKS 

In view of the above amendments and the following remarks, reconsideration of 
the rejection and further examination are respectfully requested. 

The specification and abstract have been reviewed and revised to improve their 
English grammar and U.S. form. The amendments to the specification and abstract have 
been incorporated into a substitute specification and abstract. Attached are two versions 
of the substitute specification, a marked-up version showing the revisions, as well as a 
clean version. No new matter has been added. 

Original claims 4 and 8 have been cancelled without prejudice or disclaimer to the 
subject matter contained therein. Please note that amended claims 9-1 1 now depend from 
amended claim 1, rather than cancelled claim 8. 

Claims 1-16 were rejected under 35 U.S.C. § 102(e) as being anticipated by 
Kashiwagi et al. (US 6,393,574). The rejection regarding claims 4 and 8 is considered 
moot based on their above-mentioned cancellation. Further, this rejection is believed 
clearly inapplicable to amended claims 1-3, 5-7 and 9-16 for the following reasons. 

Amended Claims 1-3, 5-7, and 9-16 are Patentable Over the Prior Art of Record 

Amended independent claim 1 recites a recording apparatus for recording, onto a 
recording medium, a compressed stream obtained by compressively encoding a broadcast 
audio/video signal, the recording apparatus including, in part: (1) an encoder operable to 
(a) receive the broadcast audio/video signal, (b) generate the compressed stream having a 
main unit from the received broadcast audio/video signal, (c) divide the compressed 
stream into a plurality of sub-units, and (d) form, according to an instruction, the main 
unit from the plurality of sub-units; (2) a recorder operable to record a compressed stream 
onto the recording medium; and (3) a system controller operable to (a) monitor a data 
size of the sub-units, and (b) provide, based on the data size, the instruction according to 
which the encoder forms the main unit from the sub-units, wherein when the size of the 
sub-units exceeds a predetermined threshold the system controller provides the 
instructions to form the main unit . Kashiwagi fails to disclose or suggest monitoring a 
data size of the sub-units, wherein when the data size of the sub-units exceeds a 



predetermined threshold a system controller provides an instruction to an encoder to form 
a main unit from sub-units, as recited in independent claim 1 . 

In contrast, Kashiwagi teaches a method and apparatus for reproducing a bit 
stream. In particular, Kashiwagi teaches recording a multimedia system stream (MSS) 
onto a recording medium, wherein the MSS comprises "i (where i is a natural number) 
system streams SS, each of which is referred to as a ' video object ' (VOB). Each video 
object . . . comprises at least one video data block interleaved with up to a maximum 8 
audio data blocks and up to a maximum 32 sub-picture data blocks." (see col. 22, lines 
11-18). Further, Kashiwagi teaches that the video data blocks, the audio data blocks, and 
the sub-picture blocks are " video object units" (VOBUs). Based on this configuration, 
each VOB (main unit) includes a limited number of VOBUs (sub-units) (i.e., at least one 
video data block, a maximum of 8 audio data blocks, and up to a maximum of 32 sub- 
picture data blocks). Thus, the following differences between the present invention, as 
recited in independent claim 1 and the Kashiwagi reference become evident. 

Kashiwagi teaches multimedia system stream having i number of VOBs (main 
units), wherein each VOB includes a specific limited number of VOBUs (sub-units). 
However, Kashiwagi does not disclose or suggest monitoring a data size of the sub-units 
and providing, based on the data size, an instruction according to which an encoder forms 
a main unit from sub-units when the data size of the sub-units exceeds a predetermined 
threshold . Specifically, Kashiwagi does not teach the above-mentioned limitations of 
independent claim 1 because forming a VOB (main unit) according to a predetermined 
limited number of VOBUs (sub-units) is not the same as monitoring a data size of the 
sub-units and according to the data size of the sub-units providing an instruction to form 
a main unit . 

Moreover, amended independent claim 16 recites a recording method, which 
includes monitoring a data size of the sub-units and providing, based on the data size, an 
instruction for forming a main unit from the sub-units . The above-mentioned method 
steps operate in the same manner as the recording apparatus recited in amended 
independent claim 1. Accordingly, the distinguishing limitations of independent claim 16 
are similar to the distinguishing limitations of independent claim 1 . 
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In view of the above, it is respectfully submitted that the Kashiwagi reference 
does not anticipate the invention as recited in amended independent claims 1 and 16. 
Furthermore, Kashiwagi does not suggest the above-discussed limitations of independent 
claims 1 and 16. Therefore, it would not have been obvious for one of ordinary skill in 
the art to modify the Kashiwagi reference so as to obtain the invention of amended 
independent claims 1 and 16. Accordingly, it is respectfully submitted that amended 
claims 1 and 16 and the claims that depend therefrom are clearly allowable over 
Kashiwagi. 

In view of the above amendments and remarks, it is submitted that the present 
application is now is condition for allowance and an early notification thereof is earnestly 
requested. The Examiner is invited to contact the undersigned by telephone to resolve 
any remaining issues. 



Respectfully submitted, 
Hideki FUKUDA et al. 




Nils E. Pedersen 
Registration No. 33,145 
Attorney for Applicants 



NEP/ALD/nrj 

Washington, D.C. 20006-1021 
Telephone (202) 721-8200 
Facsimile (202) 721-8250 
April 9, 2007 
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RECORDING APPARATUS AND METHOD 



BACKGROUND OF THE INVENTION 



FIELD OF THE INVENTION !. Field of the' Invention 

The present invention relates to a recording method and 
apparatus and, more particularly, to a recording method and 
apparatus for recording audio/video signals on recording media. 
BACKGROUND OF THE INVENTI0N 2 . Description of the Related Art 

Specifications for recording audio/video signals on a DVD 
(Digital Versatile Disk) : "DVD Specifications for Read-Only Disc 
Part 3 VIDEO SPECIFICATIONS Version 1.0" was issued in August 1996, 
and then Version 1.1 was issued in December 1997 (hereinafter, 
abbreviated as "DVD video specifications") . The DVD video 
specifications define that video signals are compressively coded 
according to an MPEG coding method of ISO/IEC 13818-2 (MPEG2 video) 
or ISO/IEC 11172-2 (MPEG1 video) . 

The MPEG coding method is a coding method by which, as a 
picture data coding process , an intra-f rame coding process of coding 
picture data utilizing intra-frame correlation of pixel values, 
and an inter-frame coding process of coding picture data utilizing 
inter-frame correlation of pixel values are adaptively switched 
and carried out. According to this MPEG coding method, the coding 
of picture data is performed with coding data corresponding to 
plural sequential frames being taken as a unit. A picture that 
is composed of plural sequential frames is called a Group of Picture 



(GOP) . More specifically, in this MPEG coding method, the 
intra-frame coding process is carried out for picture data 
corresponding to at least one of the plural frames constituting 
this GOP, and the inter-frame coding process is carried out for 
picture data corresponding to remaining frames. 

The inter-frame coding process has two kinds of processes: 
a forward prediction inter-frame coding process and a bidirectional 
prediction inter-frame coding process. A frame to which the 
forward prediction inter-frame coding is performed is referred 
to as P frame. A frame to which the bidirectional prediction 
inter-frame coding is performed is referred to as B frame. Picture 
data of P frame is predictively coded with reference to picture 
data of a frame (reference frame) that is situated forward of the 
P frame. Picture data of B frame is predictively coded with 
reference to picture data of two frames (reference frames) which 
are situated in proximity and forward and backward of the B frame. 
Usually, in the coding process for P frame, I frame that is in 
proximity of the P frame is used as the reference frame. In the 
coding process for B frame, I frame and P frame or two P frames, 
which are in proximity of the B frame, are used as reference frames . 

Figure 6 is a diagram for explaining a structure of a GOP, 
and this figure shows plural sequential frames F ( k-5 ) -F ( k+12 ) in 
contrast with coded data D (k-5) ( k+12 ) corresponding to the 
respective frames. Here, k denotes an arbitrary integer. Figure 
6 shows an example in which one GOP is constituted by 12 frames: 
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B frame F(k-2) to P frame (k+9) . For example, for P frame F(k+3) , 
the inter-frame prediction coding process is carried out with I 
frame F(k) as the reference frame. For P frame F(k+6), the 
inter-frame prediction coding process is carried out with P frame 
F(k+3) as the reference frame. Further, for B frames F(k+1) and 
F(k+2) , the inter-frame prediction coding process is carried out 
with I frame F(k) and P frame F(k+3) as the reference frames. 

For coded data corresponding to the respective frames 
obtained by the above-mentioned coding process, a process for 
changing the arrangement of the coded data from an arrangement 
in which the respective frame pictures are arranged in the order 
of display to an arrangement in which the respective frames are 
arranged in the order of decoding (arrangement changing process) 
is carried out. This is performed to reduce the amount of memory 
that is used at the decoding process. For example, when the 
arrangement changing process is carried out for coded data 
corresponding to the GOP, as the coded data which arc is obtained 
by the changing process, coded data D(k) of I frame F(k) is placed 
at the top of the coded data corresponding to the GOP, and the 
coded data D(k) is successively followed by coded data D(k-2) of 
B frame F(k-2), coded data D(k-l) of B frame F(k-l), and coded 
data D(k+3) of P frame F(k+3), as shown in figure 6. Then, the 
coded data corresponding to the GOP arc is recorded on a recording 
medium in the order that is obtained by the arrangement changing 
process . 
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As the MPEG coding method, there arc is a fixed-rate coding 
method by which video signals are compressively coded at a fixed 
data rate, and a variable-rate coding method by which video signals 
are compressively coded at variable data rates depending on various 
scenes of the video signals. Usually, video scenes having greater 
difficulty in compressive coding and video scenes having smaller 
difficulty are mixed in the video signal. Examples of the scenes 
which have greater difficulty in coding are scenes including many 
complicated patterns, scenes including many patterns moving 
complioatodly in a complicated way , scenes having large amounts 
of motion, and the like . On the other hand, examples of the scenes 
having smaller difficulty in coding are scenes having many simple 
patterns, scenes having small amounts of motion, and the like. 

The variable-rate coding method is a coding method that is 
characterized in that a large amount of information is allocated 
to scenes having greater difficulty in coding, thereby reducing 
coding distortion, while allocating a reduced amount of information 
to scenes having smaller difficulty in coding. According to this 
coding method, bit distribution can be performed efficiently and 
the average coding rate is reduced while keeping a higher image 
quality for all videos. When data is recorded on a storage medium 
such as a DVD, the recording of data is performed intermittently, 
thereby enabling the data which has been compressively coded by 
the variable-rate coding method to be handled. According to the 
DVD video specifications, the variable-rate coding method is often 
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adopted to realize recording of video signals of a higher image 
quality . 

A data structure according to the DVD video specifications 
will be described with reference to figures 4 (a) to 4 (h) . Figures 
4 are diagrams illustrating the data structure according to the 
DVD video specifications. Figure 4(a) shows a structure of data 
recorded on a disk, which is, according to the DVD video 
specifications, constituted by video manager information (VMG40a) 
and a video title set as an aggregate of video titles (VTSs) . Here, 
the video titles are referred to as VTS (1) 4 0al, VTS (2) 40a2, VTS (3) 
4 0a3, . . . , respectively . The VMG 40a is a table indicating contents 
of the video title set, on which control data that is required 
for playback of the VTSs is recorded. 

Figure 4(b) shows the structure of the VTS(l) 40al. The 
VTS(l) 40al is constituted by video title set information (VTSI 
40bl) and a video object set (VOBS 40b2) . The VTSI 40bl contains 
control information that is required for playback of this VTS. 
The VOBS 40b2 is an aggregate of video objects (VOB) which are 
composed of data such as video, audio and sub-pictures. 

Figure 4(c) shows the structure of the VOBS 40b2. The VOBS 
40b2 consists of plural VOBs . It is assumed here that the VOBS 
4 0b2 consists of n pieces of VOBs, i.e. , VOB(l) 4 0cl, VOB (2) 4 0c2, 
VOB(3) 40c3, and VOB (n) 40cn. 

Figure 4 (d) shows the structure of the VOB ( 1 ) 40cl. As shown 
in figure 4(d), the VOB(l) 40cl is constituted by plural video 



object units (VOBUs). It is assumed here that the V0B(1) is 
constituted by m pieces of VOBUs, i.e., V0BU(1) 4 0dl, V0BU(2) 4 0d2, 
V0BU(3) 40d3, and VOBU (m) 40dm. The DVD video specif ications 

define that the VOB is constituted by one or more cells and that 
the cell is constituted by one or more VOBUs, but to simplify the 
descriptions, it is assumed hereinafter that one VOB is constituted 
by one cell and no description will be given of the cell. 

Figure 4(e) shows the structure of the VOBU(l) 40dl. The 
VOBU (1) 40dl is constituted by packs of a predetermined size. The 
DVD video specifications define the pack size as 2048 bytes. The 
VOBU(l) 40dl is constituted by a navigation pack (NV_PCK 40f ) , 
video packs (V_PCK 40gl, V_PCK40g2, ...), audio packs (A_PCK40hl, 
A_PCK 40h2, ...), and sub-picture packs. In figure 4(e), the 
sub-picture packs are not shown. 

The navigation pack (NV_PCK) is placed at the head of each 
VOBU. In this navigation pack (NV_PCK) , playback information of 
this VOBU andmanagement information relating to search information 
is recorded. According to the DVD video specifications, it is 
defined that the VOBU has a video playback time of 0 . 4 sec-1 sec. 
However, as for a VOBU that is at the end of a VOB, it is defined 
that its playback time is 0 . 4 sec-1 . 2 sec . Here, the data is recorded 
with being packed and packeted in compliance with "The system part 
of the MPEG-2 standard (ISO/IEC 13818-1)". In the following 
descriptions, a method that is compliant with "The system part 
of the MPEG-2 standard (ISO/IEC 13818-1)" is referred to as an 



MPEG system coding method. 

Figure 4 (g) shows the correspondence between the video packs 
40gl, 40g2, . .., 40g4 among the packs constituting the VOBU(l) 
40dl, and a stream of GOPs . 

Generally a VOBU can be constituted by data of one or more 
GOPs, while a description will be given here of a case where the 
V0BU(1) 40dl is constituted by a video stream corresponding to 
two GOPs. To be more specific, the video stream included in the 
V0BU{1) 40dl is constituted by coded data 40vl of G0P(1), coded 
data 40v2 of G0P(2), and pudding data 40v3. Since each VOBU is 
constituted by video packs and audio packs each comprising 2048 
bytes, the data size of the VOBU is required to be an integral 
multiple of 2048 bytes . The pudding data 40v3 is added to the video 
stream corresponding to one GOP, so that the data size of the video 
stream included in the VOBU is matched with an integral multiple 
of 2048 bytes. 

Figure 4 (h) shows the correspondence between the audio packs 
40hl-40h3 among the packs constituting the VOBU(l) 40dl, and a 
stream of respective audio frames. The audio packs (A_PCK) 
40hl-40h3 which are obtained by packing coded data of audio frames 
40kl~40k4 correspond to the audio stream included in the VOBU(l) 
40dl . In figure 4 (h) , each data size of the audio frames 40kl-40k4 
is an integral multiple of 2048 bytes, but the data size is not 
restricted to this. According to the DVD video specifications, 
it is required that audio frames should be completed in one VOB, 



while it is unnecessary that audio frames should be completed in 
one VOBU, and coded data of audio frames may be divided by boundaries 
of VOBUs . However, as for the last VOBU in the VOB, it is required 
to adjust the data size such that the end of the audio frame matches 
the last audio pack of the VOBU, and thus the pudding data is inserted . 

Figure 4 (f ) shows a structure of the navigation pack (NV_PCK 
40f ) . The NV_PCK 40f is constituted by a pack header 40f 1, a system 
header 40f2, a PCI packet 40f3 in which playback information is 
described, and a DSI packet 40f4 in which data search information 
is described. The pack header 40fl is constituted by an 
identification code ( Pack_start_code ) for identifying the head 
of the pack, time information indicating a timing at which data 
is inputted to a decoder (System Clock Reference : SCR), and a 
data transfer rate of a compressed stream (program_mux_rate ) . 

The system header 40f2 contains the number of audio streams 
and the number of video streams, included in the VOBU, and the 
decoder buffer memory size required at the decoding, and the like. 

The PCI packet 40f3 contains playback control information 
of the VOBU. Examples of the VOBU playback control information 
are a leading recording address of a VOBU, (NV_PCK_LBN) , playback 
start time information of a VOBU (VOBU_S_PTM) , playback end time 
information of a VOBU ( VOBU_E_PTM) , and the like. 

Further, the DSI packet 40f 4 contains management information 
for searching a VOBU. As the management information, there are 
addresses of reference frames included in a VOBU (VOBU 1STREF EA, 
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V0BU_2 N DRE F_E A , V0BU_3RDREF_EA) , playback start time information 
(VOB_S_PTM) and playback end time information (VOB_E_PTM) of a 
VOB including the current VOBU, address information (FWDI, BWDI) 
of plural VOBUs which are located temporally forward and backward 
the current VOBU, and the like. As the FWDI and the BWDI, addresses 
of the first packs in VOBUs which are located within two minutes 
before and after the current VOBU, in a VOB including the current 
VOBU are described. Here, the FWDI is used when high-speed forward 
playback is performed, and the BWDI is used when high-speed backward 
playback is performed. 

As described above, according to the DVD video specifications, 
the playback control information of each VOBU can be recognized 
by reading the PCI packet information, and the information for 
data search can be recognized by reading the DSI packet information . 
Therefore, the playback control and the high-speed search can be 
realized only by reading data streams, whereby the cost of a playback 
apparatus can be reduced without the need of storing search 
information in a memory or the like. Thus, the DVD video 
specifications are recording formats which facilitate high-speed 
data search even with less-expensive playback apparatus. 

In figure 4(b), only the structure of the VTS(l) 40al is 
shown in detail, but the VTSs other than the VTS(l) 40al, i.e., 
the VTS(2) 40a2 and the VTS{3) 40a3 also have the same structure 
as the VTS(l) 40al, respectively. In figure 4(d), only the 
structure of the V0B(1) 40cl is shown in detail, but the VOBs other 
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than the VOB (1) 4 Ocly Ji . e . , theVOB(2) 4 0c2 to VOB (n) j_ 40cn have 
also the same structure as the VOB ( 1 ) 40cl, respectively. Further, 
in figure 4(e), only the structure of the VOBU(l) 40dl is shown 
in detail, but the VOBUs other then than the VOBU(l) 40dl, i.e., 
the VOBU(2) 40d2 to VOBU (m) 40dm also have the same structure as 
the VOBU(l) 40dl, respectively. 

The navigation pack is placed at the head of the VOBU, while 
the playback end time information of the VOBU stored in the PCI 
packet cannot be decided until the number of video frames included 
in the VOBU (GOP structure) is determined. Further, the search 
information stored in the DSI packet cannot be decided until the 
recording address of a VOBU which is two minutes ahead at maximum 
is determined. Further, the playback end time information of a 
VOB including the current VOBU cannot be decided until the VOB 
is determined. Therefore, when data arc is recorded in real time 
in compliance with the DVD video specifications , prior art recording 
methods and apparatus have the following various problems. 

Hereinafter, these problems will be described in detail. 
A prior art recording apparatus which forms VTSs according to the 
DVD video specifications (authoring system) previously creates 
a compressed stream of video data and audio data of a VTS to be 
recorded, decides a VOBU configuration included in the VTS, and 
thereafter analyzes the compressed stream in a separate process. 
From the analysis of the compressed stream, the playback end time 
information of each VOBU, the playback end time information of 



VOBs, and recording addresses of VOBUs located forward and backward 
of the current VOBU are detected, thereby creating data of a 
navigation pack. 

Therefore, in order to record a compressed stream to which 
the navigation pack data has been inserted, a processing time taken 
to compressively code all audio/video signals of a VTS, a processing 
time taken to analyze an obtained compressed stream, a processing 
time taken to create a navigation pack, and a processing time taken 
to insert the navigation pack into the compressed stream a re required . 
However, in consumer recording apparatuo devices, which receive 
television broadcasts and the like to record/playback the same, 
it is required to carry out the above-mentioned processes in real 
time. Therefore, the above-mentioned processing times cause 
problems . 

Further, the same structure as the above-mentioned authoring 
system has a problem of requiring a memory or another recording 
medium having such a capacity that a compressed stream of one VTS 
is stored before analyzing the compressed stream, resulting in 
an increase in costs. This problem may be solved by fixing the 
data sizes of all VOBUs to a predetermined value. According to 
this solution, it is possible to calculate recording addresses 
of VOBUs which are situated forward and backward of a target VOBU 
as the VOBU search information, to be integral multiples of the 
predetermined fixed size. However, in order to fix the data sizes 
of the VOBUs, the fixed-rate coding method should be adopted and, 
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in this case, the degradation in picture quality occurs more easily 
as compared to the variable-rate coding method. 

The playback end time of a VOB included in the DSI packet 
cannot be decided until the VOB is determined. This problem can 
be solved for example by previously recording the end time of the 
VOB as a predetermined time-rj^i.e., dividing the VOB periodically 
when the VTS is recorded_)_. However, in this solution, even when 
the user instructs the recording apparatus to stop recording during 
the recording, the recording cannot be stopped till until the 
predetermined end time of the VOB. Further, when the VTS is recorded 
with dividing the VOB periodically, since the DVD video 
specifications restrict the maximum number of VOBs, this maximum 
number of VOBs should be taken into consideration. 

SUMMARY OF THE INVENTION 

The objection of the present invention has for its objcct is 
to provide a recording method and apparatus which can record 
audio/video signals on a recording medium in real time in a recording 
format that facilitates high-speed data search. 

Another object of the present invention is to provide a 
recording method and apparatus which can compressively encode video 
signals of high picture quality by a variable-rate coding method 
and record the signals on a recording medium in real time in a 
recording format that facilitates high-speed data search. 

Another object of the present invention is to provide a 
recording method and apparatus which includes a user interface 
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that enables an instant stop of recording upon receipt of a recording 
stop notice by the user, as well as can record audio/video signals 
on a recording medium in real time in a recording format that 
facilitates high-speed data search. 

Still another object of the present invention is to provide 
a recording method and apparatus which can record audio/video 
signals on a recording medium in real time in a recording format 
that is compliant with the DVD video specifications. 

Other objects and advantages' of the present invention will 
become apparent from the detailed description and specific 
embodiments described therein, which are provided only for 
illustration since various additions and modifications within the 
spirit and scope of the invention will be apparent to those of 
skill in the art from the detailed description. 

According to a lot first aspect of the present invention, 
there is provided a recording apparatus for recording a compressed 
stream that is obtainedby compressively coding audio/video signals, 
on a recording medium . This recording apparatus includes — 
including : an encoder for compressively coding the audio/video 
signals and outputting a compressed stream-?—, a recording buffer 
memory for storing the compressed stream-r^a recorder for recording 
the compressed stream stored in the recording buffer memory T on 
to the recording medium^and a system controller for controlling 

the respective units 7 The and the encoder divides a compressed 

stream of audio/video signals within a predetermined time range 
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to form plural sub-units^ and forms a main unit from a group of 
these sub-units to output the compressed stream , and ao well ao 
creates sub-unit attribute information concerning the sub-units^ 
The , and the system controller generates management information 
for each of the sub-units from the corresponding sub-unit attribute 
information, and inserts the management information in a 
predetermined position in the main unit . The- — and the recorder 
reads the compressed stream successively from the recording buffer 
memory when the sub-unit management information has been inserted 
into the main unit, and records the read stream on the recording 
medium. Therefore, a navigation pack including search information 
and playback control information of each sub-unit can be inserted 
into a compressed stream in real time, to be recorded on a recording 
medium. Accordingly, the audio/video signals can be recorded in 
real time on a recording medium of a recording format that is 
compliant with the DVD video specifications, as a recording format 
which facilitates high-speed data search. 

According to a 2nd second aspect of the present invention, 
in the recording apparatus of the 1st aspect, which is similar to 
the first aspect of the present invention, except that the encoder 
compressively encodes the audio/video signals by a variable-rate 
controlled compression method. Therefore, video signals of high 
picture quality and audio signals of high sound quality can be 
recorded on a recording medium of a limited capacity. 

According to a 3rd third aspect of the present invention, 
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jrR -which is similar to the recording apparatus of the lot first 
aspect , except that the maximum number of the main units constituting 
the compressed stream is defined as a predetermined value N. 
Therefore, the upper limit of a main unit management table can 
be restricted, and the memory size for the management table and 
the main units can be easily managed. 

According to a 4th fourth aspect of the present invention, 
jrft — which is similar to the recording apparatus of the 3rd third 
aspect, except that when the number of the main units constituting 
the compressed stream reaches the predetermined value N, the 
recording of the compressed stream is stopped. Therefore, the 
maximum number of main units to be recorded is limited to N, thereby 
restricting the upper limit of the main unit management table, 
and accordingly the memory size for the management table and the 
main units can be easily managed. 

According to a 5th fifth aspect of the present invention, 
•3H3 — which is similar to the recording apparatus of the lot first 
aspect, except that the system controller forms a main unit set 
from a group of the plural main units, and when the maximum value 
of the main units included in the main unit set is defined as a 
predetermined value n and when the number of the main units 
constituting the compressed stream reaches the predetermined value 
n, the main unit set is formed from a group comprising the 
predetermined value n of the main units, then followed by starting 
formation of a next main unit set. Therefore, the maximum number 
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of the main units within the main unit set is limited to N, thereby 
restricting the upper limit of a main unit management table, and 
accordingly the memory size for the management table and the main 
units can be easily managed. 

According to a 6th sixth aspect of the present invention, 
4-fi- which is similar to the recording apparatus of the 3rd third 
aspect, except that a memory size of the recording buffer memory 
is defined by a recording capacity of the recording medium and 
the predetermined value N. Therefore, the recording capacity of 
the recording medium can be utilized efficiently. Further, the 
memory size of the recording buffer memory is minimized, whereby 
the cost of the recording apparatus can be reduced. 

According to a 7th seventh aspect of the present invention, 
-arn— which is similar to the recording apparatus of the 3rd third 
aspect, except that a memory size of the recording buffer memory 
is defined by a size of data to be recorded and the predetermined 
value N. Therefore, the data can be recorded with reliability. 
Further, the memory size of the recording buff er memory isminimized, 
whereby the cost of the recording apparatus can be reduced. 

According to an 8th eighth aspect of the present invention, 
jrfi — which is similar to the recording apparatus of the lot first 
aspect, except that the encoder forms themainunit from the sub-unit 
group when a total data size of the sub-unit group becomes equal 
to or larger than a predetermined value M. Therefore, the data 
sizes of the main units can be made an almost uniform value— J_i . e . , 



the predetermined value M]_, whereby the memory size of the recording 
buffer memory can be properly decided on the basis of the 
predetermined value M. 

According to a 9th ninth aspect of the present invention, 
jH3- which is similar to the recording apparatus of the 8th eighth 
aspect, except that the predetermined value M is defined by a memory 
size of the recording buffer memory. Therefore, when the 
predetermined value M is decided so that at least one main unit 
can be stored in the recording buffer memory, navigation data of 
each sub-unit can be inserted into the recording buffer memory, 
whereby the audio/video signals can be recorded on a recording 
medium in a recording format that facilitates high-speed data 
search. 

According to a 10th tenth aspect of the present invention, 
•aH* -which is similar to the recording apparatus of the 8th eighth 
aspect , except that the maximum number of the main units constituting 
the compressed stream is set at a predetermined value N, and the 
predetermined value M is defined by a recording capacity of the 
recording medium and the predetermined value N. Therefore, the 
predetermined value M for efficiently using the recording capacity 
of the recording medium can be decided. Further, the memory size 
of the recording buffer memory decided by the predetermined value 
M can be minimized, whereby the cost of the recording apparatus 
can be reduced. 

According to an 11th eleventh aspect of the present invention, 
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jrfi- which is similar to the recording apparatus of the 8th eighth 
aspect, except that the maximum number of the main units constituting 
the compressed stream is set at a predetermined value N, and the 
predetermined value M is defined by a size of data to be recorded 
and the predetermined value N. Therefore, the predetermined value 
M for recording data of a predetermined data size can be decided, 
and the memory size of the recording buffer memory decided by the 
predetermined value M can be minimized, whereby the cost of the 
recording apparatus can be reduced. 

According to a 12th twelfth aspect of the present invention, 
jrtt— which is similar to the recording apparatus of the lot first 
aspect, except that the encoder detects at least one piece of 
information among (i) information concerning a data size of the 
sub-unit, (ii) information concerning a position of the sub-unit 
in the main unit, and (iii) information concerning a playback time 
of the sub-unit, as the sub-unit attribute information . Therefore, 
search information and playback control information as sub-unit 
management information can be generated by the system controller. 

According to a 13th thirteenth aspect of the present invention, 
jrfl— which is similar to the recording apparatus of the 1st first 
aspect, except that the system controller inserts the sub-unit 
management information into the main unit so as to be placed at 
a head of each sub-unit. Therefore, reading of navigation data 
as the sub-unit management information can be performed 
simultaneously with the reading of sub-unit data, whereby the 
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compressed stream can be recorded in a recording format enabling 
the search information and the playback control information to 
be easily recognized. 

According to a 1 4 th A fourteenth aspect of the present 
invention 7 — tke recites the recording apparatus of the lot first 
aspect comprioco further including a unit for issuing a recording 
stop command or a recording start command, the system controller 
posts a coding stop instruction to the encoder when the recording 
stop command is issued, and the encoder finishes forming the main 
unit when receiving the coding stop instruction, taking a sub-unit 
that is being formed at a time when the instruction is received 
as a last sub-unit. Therefore, the recording of the compressed 
stream can be stopped at an arbitrary timing. 

According to a 15th A fifteenth aspect of the present invention^ 
the- recites the recording apparatus of the jre-fc- f irst aspect comprioco 
further comprising a decision unit for deciding the type of the 
recording medium, a^d -wherein the system controller selects either 
inserting the sub-unit management information in a predetermined 
position in the main unit on the basis of a result of the decision 
by the decision unit, or controlling controls the recorder for 
recording the sub-unit management information in a sub-unit 
management area on the recording medium. Therefore, the 
audio/video signals can be recorded in a recording format according 
to the type of the recording medium. 

According to a 16th sixteenth aspect of the present invention, 
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there is provided a recording method by which a compressed stream 
that is obtained by compressively coding audio/video signals is 
recorded on a recording mediu m. The recording method includes — 
comprising : a coding step of compressively coding the audio/video 
signals, thereby generating a compressed streamy— , a storage step 
of storing the compressed stream-r-^a recording step of recording 
the compressed stream stored in the storage step on the recording 
medium-^-, and a system control step of controlling the respective 
steps . In , in which, in the coding step, a compressed stream of 
audio/video signals within a predetermined time range is divided 
to form plural sub-units, and a main unit is formed from a group 
of the sub-units as well as sub-unit attribute information 
concerning the sub-unit is created . In , in the system control step, 
management information of each of the sub-units is generated from 
the corresponding sub-unit attribute information, and each of the 
management information is inserted in a predetermined position 
in the main unit . In , and in the recording step, the compressed 
stream in which the sub-unit management has been inserted is 
successively recorded onto the recording medium. Therefore, a 
navigation pack including search information and playback control 
information of each sub-unit can be inserted into a compressed 
stream in real time, to be recorded on the recording medium. 
Accordingly, the audio/video signals can be recorded in real time 
on a recording medium of a recording format that is compliant with 
the DVD video specifications, as a recording format which 
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facilitates high-speed data search. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a block diagram illustrating a structure of a 
recording apparatus according to a first embodiment of the present 
invention. 

Figure 2 is a block diagram illustrating a structure of an 
audio/video encoder 101 of the recording apparatus according to 
the first embodiment. 

Figure 3 is a diagram for explaining an operation of a system 
controller 105 of the recording apparatus according to the first 
embodiment, concerning a navigation pack insertion process. 

Figure 4(a) is a diagram illustrating a structure of data 
on a disk when an audio/video stream is recorded on a recording 
medium. 

Figure 4(b) is a diagram illustrating a data structure of 
a video title. 

Figure 4(c) is a diagram illustrating a data structure of 
a video object set. 

Figure 4 (d) is a diagram illustrating a data structure of 
a video obj ect . 

Figure 4(e) is a diagram illustrating a data structure of 
a video object unit. 

Figure 4(f) is a diagram illustrating a data structure of 
a navigation pack. 

Figure 4 (g) is a diagram illustrating video packs 
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constituting a video object . Figure 4 (h) is a diagram illustrating 
audio packs constituting a video object. 

Figure 5 is a schematic diagram for explaining an operation 
of a video coding unit 202 of the recording apparatus according 
to the first embodiment, showing plural frames constituting one 
GOP, timing of notification of a stop command, and timing when 
the video coding unit 202 stops coding. 

Figure 6 is a schematic view for explaining the operation 
of the video coding unit 202 according to the first embodiment, 
showing plural frames constituting one GOP. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS INVENTION 

Hereinafter, an embodiment of the present invention will 
be described. 
[Embodiment 1] 

Figure 1 is a block diagram for explaining a recording 
apparatus according to the first embodiment of the present invention. 
The recording apparatus according to the first embodiment is a 
recording apparatus which comprises an audio/video encoder 101, 
a recording buff er memory 102 , a recording processor 103, arecording 
head 104, a system bus 107, a user interface unit 108, a playback 
head 109, a decision unit 110, and a system controller 105 for 
controlling the respective constituents, and a recording medium 
106, which records inputted audio/video signals on a recording 
medium 106 . The decision unit 110 is a means for deciding the type 
of the recording medium 106 on the basis of information read by 
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the playback head 109. In this first embodiment, a description 
will be given of the operation of the recording apparatus in a 
case where the decision unit 110 decides that the recording medium 
106 is a recording medium that is compliant with the DVD video 
specifications. Here, the decision unit 110 decides the type of 
the recording medium 106 on the basis of information that is read 
from an inner radius of the recording medium 106 by the playback 
head 109 for deciding the typo of the recording medium . The decision 
unit 110 may decide the type of the recording medium 106 on the 
basis of the amount of reflected light (i.e., reflection ratio) 
that is obtained when the playback head 106 irradiates the recording 
medium 106 with a laser (reflection ratio) . The recording medium 
can be decided by this method because ^he -each type of recording 
medium 10 6 has different physical properties according to tho types . 

The audio/video encoder 101 encodes inputted audio/video 
signals. This encoding process generates a compressed stream 
having a sub-unit configuration and a main unit configuration which 
are compliant with the DVD video specifications, and outputs the 
same. The audio/video encoder 101 divides the compressed stream 
into plural sub-units, and constitutes a main unit by a group of 
these sub-units. 

Here, the description will be given assuming that the sub-unit 
corresponds to a VOBU, the main unit corresponds to a VOB, and 
a main unit set that is constituted by a group of the main units 
corresponds to a VTS in the recording apparatus of the first 



embodiment. More specifically, the audio/video encoder 101 
divides the compressed stream to form plural VOBUs, and forms a 
VOB from a group of these VOBUs. Here, one VOB is completed at 
a time when the total data size of a group of VOBUs constituting 
the VOB exceeds a predetermined threshold M, and then a new VOB 
is started from the following VOBU . That is, the data size of the 
VOB is made almost equal to the threshold M. It is assumed that 
the threshold M is smaller than the capacity B of the recording 
buffer memory 102. This is because it is required to store data 
of at least one VOB in the recording buff er memory 102 . The threshold 
Mis a value that is defined by the system controller 105 and sent 
from the system controller 105 to the audio/video encoder 101. 
The way to obtain the threshold M will be described later. 

Further, the audio/video encoder 101 extracts attribute 
information concerning each VOBU, and posts the information to 
the system controller jr£ £105 . The system controller 105 stores 
the VOBU attribute information. Further, the audio/video encoder 
101 gives a last VOBU notice to the system controller 105. This 
is because the system controller 105 should specify the last VOBU 
in a VOB. 

The recording buffer memory 102 has a memory of a predetermined 
memory size, and retains compressed streams . When one VOB is stored 
in the recording buffer memory 102, the system controller 105 
generates PCI data and DSI data as navigation data, from attribute 
information of respective VOBUs constituting the stored VOB, to 
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form a navigation pack. The system controller 105 recognizes that 
one VOB has been stored in the recording buffer memory 102 upon 
receipt of the notice of the last VOBU in the VOB from the audio/video 
encoder 101. Then, after the navigation pack is inserted at the 
top of each VOBU stored in the recording buffer memory 102, the 
system controller 105 controls the compressed streams in which 
the navigation packs have been inserted, for being successively 
inputted to the recording processor 103. 

The recording processor 103 carries out an error correction 
code addition process and a recording modulation process for the 
compressed stream in which the navigation pack has been inserted, 
and outputs a record signal. The recording head 104 records the 
record signal on the recording medium 106 . The user interface unit 
108 is a means for issuing a recording start command or a recording 
stop command by the user. The system controller 105 performs 
control of the respective constituents, including the 
above-mentioned operation, and communicates with the respective 
constituents via the system bus 107. 

Here, the VOBU attribute information outputtcd by output from 
the audio/video encoder 101 is described. The VOBU attribute 
information is information that is required to generate navigation 
data. The navigation pack includes information that is required 
for multi-angle playback as a recording form of the DVD video 
specifications, button highlight information on a menu screen, 
information concerning sub-pictures, and the like. In this first 
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embodiment , the description will be given only of information for 
recording or playing back a series of sequential VTSs. Here, as 
for the angle information, highlight information, sub-picture 
information and the like, predetermined default values which are 
defined in the DVD video specifications can be recorded, and 
accordingly they will not be described here. 

Examples of the VOBU attribute information detected by the 
audio/video encoder 101 are shown below. 

(1) nv_pck__scr : system clock reference information of a 
navigation pack. This is obtained when respective coded data of 
video and audio are multiplexed to form a VOBU, by reading out 
system clock reference information of a pack that is situated at 
the top of a VOBU. 

(2) vobu_ea : the number of packs in a VOBU. This is obtained by 
counting the number of packs included in a VOBU. 

(3) vobu_s_ptm : playback start time information of a playback 
top video frame in a VOBU. When a Group of Pictures ( GOP) at the 
top of the VOBU has L for example^ the structure as shown in figure 
6, the playback top video frame corresponds to B frame F(k-2) at 
the tope of the GOP. 

(4) vobu_frame : the number of frames in a VOBU. This is obtained 
by counting the number of frames included in a VOBU. 

(5) vobu_lstref_ea : the number of packs from the head of a VOBU 
to a pack including the end of the top reference frame in this 
VOBU. When the structure of the GOP at the head of the VOBU has 



-27- 



for example the structure as shown in figure 6, I frame F(k) at 
the head of the GOP corresponds to the top reference frame. 

(6) vobu_2ndref_ea : the number of packs from the head of a VOBU 
to a pack including the end of a second reference frame in this 
VOBU . When a GOP at the top of the VOBU has for example the structure 
as shown in figure 6, P frame F(k+3) as a P frame occurring first 
in the GOP corresponds to the second reference frame. 

(7) vobu_3rdref_ea : the number of packs from the head of a VOBU 
to a pack including the end of a third reference frame in this 
VOBU. When a GOP at the top of the VOBU has_^ for example^ the 
structure as shown in figure 6, P frame F(k+6) as a secondly occurring 
P frame in the GOP corresponds to the third reference frame. 

(8) a_synca : positional information of an audio pack which is 
played back in synchronization with the playback top video frame 
in the current VOBU. When a GOP at the top of the VOBU has^ for 
example^ the structure as shown in figure 6, B frame F(k-2) at 
the top of the GOP corresponds to the playback top video frame. 

Examples of parameters of a navigation pack generated by 
the system controller 105 when a series of VTSs are recorded or 
played back are shown below. These parameters can be generated 
using the above-mentioned VOBU attribute information. 

(1) NV_PCK_SCR : system clock reference information of a navigation 
pack. This is the same as nv_pck_scr of the VOBU attribute 
information . 

(2) NV_PCK_LBN : positional information of the head of a VOBU with 
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respect to the head of a VOB. This is obtained by cumulatively 
adding the numbers of VOBU packs (vobu_ea) in the past VOBU attribute 
information, from the top of the VOB. 

(3) VOBU_S_PTM : video playback start time information of a VOBU . 
This is the same as vobu_s_ptm of the VOBU attribute information. 

(4) VOBU_E_PTM : video playback end time information of a VOBU. 
This is obtained by adding vobu_s_ptm of the VOBU attribute 
information, and a value that is obtained by converting the number 
of VOBU frames (vobu_frame) into time information. 

(5) VOBU_EA : VOBU size (the number of packs) . This is the same 
as vobu_ea of the VOBU attribute information. 

(6) V0BU_1STREF_EA : This is the same as vobu_lstref_ea of the 
VOBU attribute information. 

(7) V0BU_2NDREF_EA : This is the same as vobu_2ndref_ea of the 
VOBU attribute information. 

(8) VOBU_3RDREF_EA : This is the same as vobu_3rdref_ea of the 
VOBU attribute information. 

(9) VOB_V_S_PTM : video playback start time information of a VOB 
including the current VOBU. This is the same as vobu_s_ptm of the 
first VOBU in the VOB. 

(10) VOB_V_E_PTM : video playback end time information of a VOB 
including the current VOBU. This is the same as vobu_e_ptm of the 
last VOBU in the VOB. 

(11) FWDI : forward search information. This is positional 
information of each VOBU that is situated within two minutes ahead 



at maximum in a VOB including the current VOBU . This is obtained 
from NV_PCK_LBN of all VOBUs included in the VOB. 

(12) BWDI : backward search information. This is positional 
information of each VOBU that is situated within two minutes behind 
at maximum in a VOB including the current VOBU. This is obtained 
from NV_PCK_LBN of all VOBUs included in the VOB. 

(13) A_SYNCA : This is the same as a_syncha of the VOBU attribute 
information . 

(14) VOBU_VOB_IDN : a serial number of a VOB from the head of a 
VTS. This is obtained by counting the number of VOBs generated 
by the audio/video encoder 101 from the start of recording. 

Among the parameters of the navigation data, NV_PCK_SCR, 
NV_PCK_LBN, VOBU_S_PTM, VOB_S__PTM and BWDI can be obtained at a 
time when a VOBU is formed by the audio/video encoder 101. This 
is because the audio/video encoder 101 can obtain these parameters 
by storing the numbers of packs and playback time information of 
past VOBUs . Also, VOBU_VOB__IDN can be obtained by the audio/video 
encoder 101. This is because VOBU_VOB_IDN can be obtained by 
counting the numbers of VOBs formed by the audio/video encoder 
101 from the start of the recording. 

Further, VOBU_E_PTM, VOBU_EA, V0BU_1STREF_EA, 
VOBU_2NDREF_EA and VOBU_3RDREF_EA can be generated by the 
audio/video encoder 101 by including a memory for storing data 
of one VOBU in the audio/video encoder 101, and delaying a compressed 
stream output by one VOBU. This is possible because these the 
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navigation data can be generated by analyzing one VOBU data. 

However, VOB_E_PTM and FWDI cannot be obtained until a VOB 
is determined. This is because, in order to obtain V0B_E_PTM and 
FWDI , at least information for specifying the position of a VOBU 
in the VOB and information for obtaining the playback end time 
information of the VOB is required as the VOBU attribute information . 
Therefore, VOB_E_PTM and FWDI are obtained by the system controller 
105 while the VOB is stored in the recording buffer memory 102. 
In the recording apparatus according to the first embodiment, the 
number of packs in the VOBU as the information for specifying the 
position of the VOBU in the VOB is employed as the VOBU attribute 
information, and the playback start time information of the VOBU 
and the number of frames in the VOBU as the information for obtaining 
the playback end time information of the VOB are employed as the 
VOBU attribute information. 

As described above, when part of the navigation data is 
obtained by the audio/video encoder 101, a provisional navigation 
pack is inserted at the top of the VOBU. In this case, the 
audio/video encoder 101 writes part of the generated navigation 
data in the provisional navigation pack. Then, the system 
controller 105 generates navigation data other than the navigation 
data that has been generated by the audio/video encoder 101, and 
forms^ a complete navigation pack while the VOB data is stored in 
the recording buffer memory 102. Consequently, the amount of 
processing by the system controller 105 can be reduced. 
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Next, the audio/video encoder 101 will be described in more 
detail with reference to figure 2. The audio/video encoder 101 
shown in figure 2 comprises an encoder control unit 201 for 
performing control of respective constituents and control of 
communication with external means, a video coding unit 202, an 
audio coding unit 203, a system encoder 204, a VOBU formation unit 
205, a VOBU information extraction unit 206, a VOB size detection 
unit 207 and a VOB formation unit 208, and outputs a compressed 
stream having a VOB configuration according to the DVD video 
specifications . 

The video coding unit 202 compressively encodes an inputted 
video signal and outputs a video stream. Here, the description 
will be given of a case where video signals are compressively encoded 
by the MPEG coding method. Figure 6 is a diagram illustrating an 
example of a GOP configuration of a video signal which has been 
compressively encoded by the MPEG coding method and the coding 
type of each frame. The audio coding unit 203 encodes an inputted 
audio signal and outputs an audio stream. The audio coding unit 
203 can compressively encode audio signals according to the Dolby 
AC3 coding method, the MPEG audio coding method or the like, and 
output coded data. Or alternatively, the audio coding unit 203 
can encode audio signals according to a non-compressive coding 
method such as linear PCM, and outputs coded data. 

The system encoder 204 multiplexes the video stream and the 
audio stream. Here, the system encoder 204 packets and packs the 
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video stream and audio stream in compliance with the MPEG system 
coding method, thereby creating video packs and audio packs, and 
further multiplexes the videopacks and the audiopacks in compliance 
with the MPEG system coding method . The system encoder 204 performs 
the packing by carrying out a pudding process for the video stream 
and the audio stream so that the VOBU data sizes become integral 
multiples of 2048 bytes. 

Next, the VOBU formation unit 205 divides the multiplexed 
stream into VOBUs . As can be seen in figure 4(g) showing the 
relationship between the video packs as constituents of each VOBU 
and GOPs, one VOBU is constituted by video data of one or more 
GOPs. Further, the VOBU is formed so that the playback time of 
the video data included in the VOBU is from 0 . 4 sec— to 1 . 0 sec. 

The relationship between the audio packs included in the 
multiplexed stream^ which are divided into VOBUs on the basis of 
GOP data of video— and coded data of audio frames^ is the same 
as that shown in figure 4 (h) . 

The VOBU information extraction unit 206 detects attribute 
information of each VOBU formed by the VOBU formation unit 205. 
The detected VOBU attribute information is posted to the system 
controller 105 via the encoder control unit 201 and the system 
bus 107. The VOBU attribute information is the same as that 
described above. 

Here , it is possible that the VOBU formation unit 205 generates 
a provisional navigation pack and the system encoder 204 inserts 
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the provisional navigation pack. This is because PCI data and DSI 
data of the navigation pack cannot be obtained until the 
configuration of a VOB is determined, while predetermined fixed 
navigation data can be easily generated by the audio/video encoder 
101. Further, as described above, it is also possible to store 
past VOBU attribute information, generate navigation data which 
can be obtained from the stored VOBU attribute information, and 
insert the generated navigation data in a provisional navigation 
pack . 

The VOB size detection unit 207 cumulatively adds the data 
sizes of a group of VOBUs constituting a VOB, thereby detecting 
the data size of the VOB. The VOB formation unit 208 forms a VOB 
from the VOBU group. When detecting that the data size of the VOB 
that is being formed has exceeded a predetermined threshold M, 
the VOB formation unit 208 decides the last VOBU in the VOB and 
closes the VOB. Then, the VOB formation unit 208 starts to form 
a new VOB from a VOBU following the last VOBU. When the VOB formation 
unit 208 completes outputting of the last VOBU to the recording 
buffer memory 102 , a last VOBU notice is sent to the system controller 
105. The last VOBU notice to the system controller 105 is sent 
by the encoder control unit 210 via the system bus 107. 

Here, it is desirable that the VOB formation unit 208 decides 
a VOBU at a time when the data size of the VOBU that is being formed 
has exceeded the predetermined threshold M to be the last VOBU, 
but a little delay may occur after it detects that the data size 
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has exceeded the threshold M until the last VOBU is decided. This 
delay depends on the memory size of the recording buffer memory 
102 . The method for deciding the memory sizeofthe recording buffer 
memory 102 will be described later. 

Next, the system controller 105 of the recording apparatus 
according to the first embodiment will be described in more detail 
with reference to figure 3 . Figure 3isablock diagram illustrating 
a structure of the system controller 105 . With reference to figure 
3, the operation of the system controller 105 concerning the process 
for inserting the navigation pack will be described. As shown in 
figure 3, the system controller 105 comprises a recording buffer 
data reading unit 301, a VOBU control unit 302, a navigation data 
generation unit 303, and a navigation pack writing unit 304. 

The VOBU control unit 302 stores the VOBU attribution 
attribute information posted by the audio/video encoder 101. The 
VOBU attribute information of at least one VOB is stored. The last 
VOBU in the VOB can be recognized by the last VOBU notice posted 
by the audio/video encoder 101 . Upon receipt of the last VOBU notice, 
the navigation data generation unit 303 generates navigation data 
from the VOBU attribute information belonging to the VOB. The 
method for generating the navigation data using the VOBU attribute 
information is the same as that described above. For example, 
VOBU_EA and NV_PCK_LBN are obtained from the VOBU size (vobu_es) 
of the VOBU attribute information. 

The recording buffer data reading unit 301 obtains the address 
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of the top pack of a VOBU in the recording buffer memory 102 on 
the basis of the number of packs of each VOBU included in the VOBU 
attribute information. Since the data size of each pack is 2048 
bytes, the address of the top pack of the VOBU can be obtained 
when the address at the start of recording is taken as an offset 
address . 

The navigation pack writing unit 3 04 forms a navigation pack 
from the navigation data, and inserts the formed navigation pack 
in a position at the address of the top pack of the VOBU in the 
recording buffer memory 102 . Here, when a provisional navigation 
pack has been inserted by the audio/video encoder 101, only 
navigation data generated by the navigation data generation unit 
303 is written in a predetermined position in the provisional 
navigation pack. 

Here, according to the DVD video specifications, as shown 
in figure 4(a) , it is necessary to record also the VMG 40a such 
as management information of each VTS . This VMG 40a is generated 
by the system controller 105 using the VOBU attribute information 
posted by the audio/video encoder 101 and the like, and recorded 
-ii ^onto a predetermined position on the recording medium 106. 

As described above, according to this first embodiment, the 
delay depending on the memory size of the recording buffer memory 
102 occurs- while a compressed stream^ into which the navigation 
pack has been inserted^ can be recorded on a recording medium in 
real time. Therefore, data of the recording format that is 
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compliant with the DVD video specifications as a recording format 
that facilitates high-speed data search can be recorded on a 
recording medium in real time. 

Here, in the recording apparatus of the present invention, 
the audio/video encoder 101 may compressively encode video signals 
by the variable-rate controlled compressive coding method. 
According to the variable-rate controlled compressive coding 
method, the size of each VOBU (the number of packs) is varied 
depending on input signals, and while in the recording apparatus 
of the present invention, the audio/video encoder 101 posts the 
number of packs of each VOBU to the system controller 105 . Therefore, 
even when the video signals are compressively coded by the 
variable-rate controlled compressive coding method, the system 
controller 105 can generate the navigation pack. According to the 
variable-rate controlled compressive coding method, the video 
signal is compressively coded at a bit rate depending on the degree 
of difficulty in coding of the inputted video signal, whereby even 
when the average rate is reduced to a lower level, video of high 
picture quality can be obtained. Therefore, video signals of high 
picture quality can be recorded on a recording medium having a 
limited recording capacity. 

Similarly, audio signals can be compressively coded by the 
variable-rate compressive coding method. In this case, audio 
signal of high sound quality can be recorded. When audio signals 
are coded by the variable-rate compressive coding method, the 
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audio/video encoder 101 detects the degree of difficulty in the 
coding of an inputted signal in a prescribed period, decides the 
amount of bits to be assigned according to the degree of difficulty 
in the coding, and encodes the inputted signal according to the 
obtained amount of bits. 

Further, in this recording apparatus of the present invention, 
it is also possible that the system controller 105 defines the 
maximum value of the number of VOBs constituting one VTS as a 
predetermined value N and posts this predetermined value N to 
audio/video encoder 101 . When the maximum number of VOBs is limited 
to decide an upper limit of the VOB management table, the memory 
size for the VOB management table is limited , and ao well ao the 
VOB management is performed easily performed . 

According to the DVD video specifications , the maximum number 
of cells in one VTS is limited to 255. Since a VOB is constituted 
by at least one cell, consequently the maximum number of VOBs is 
limited to 255. Therefore, according to the DVD video 
specifications, it is decided that the predetermined value N = 
255, and the recording of one VTS is stopped at a time when the 
number of VOBs reaches the maximum number N. Accordingly, the 
recording can be performed without exceeding the predetermined 
maximum number of VOBs. Further, since the DVD video 
specifications define the maximum number of VOBs as 255, it is 
also possible that the audio/video encoder 101 defines the 
predetermined value N as 255. 
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In order to stop recording at a time when the number of VOBs 
reaches the maximum number, it is required to monitor the number 
of VOBs. This monitoring of the number of VOBs is carried out by 
the system controller 105 by counting the last VOBU notice. The 
system controller 105 posts a recording stop command to the 
respective constituents when the number of VOBs reaches the maximum 
value, and then the recording apparatus stops the VTS recording. 
Here, when the audio/video encoder 101 defines the predetermined 
value N, the audio/video encoder 101 may monitor the number of 
VOBs. 

It is also possible that the recording of one VTS is finished 
at a time when the number of VOBs reaches the maximum value, and 
thereafter recording of a new VTS is continuously carried out. 
That is, it is possible to divide the VTS to be recorded. By doing 
so, the VTSs can be recorded without interrupting the recording 
and exceeding the maximum number of VOBs. Here, the division of 
the VTS is carried out by the system controller 105. 

In this recording apparatus of the present invention, the 
memory size B of the recording buffer memory 102 can be defined 
on the basis of the recording capacity of the recording medium 
106 and the maximum number N of VOBs included in the VTS . For example, 
assuming that the memory size of the recording buffer memory 102 
is b, since it is necessary to store at least one VOB in the recording 
buffer memory 102, the maximum data size of the VOB becomes b. 

Therefore, assuming that the maximum number of VOBs in the 
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VTS is N, the maximum data size V of one VTS is: 

V = N x b ... (1) 
Here, assuming that the recording capacity of the recording medium 
106 is C, when 

C > V ... (2) , 

since the data size of one VTS recorded on the recording medium 
does not reach the recording capacity of the recording medium, 
only part of the recording capacity of the recording medium 106 
is used, whereby the recording capacity cannot be utilized 
effectively . Therefore, it is difficult to record audio/ video data 
of high audio/picture quality. 

On the other hand, when the memory size of the recording 
buffer memory 102 is sufficiently increased, the recording capacity 
of the recording medium 106 can be utilized effectively, while 
the costs of the recording apparatus are adversely increased. Thus, 
the memory size B of the recording buffer memory 102 is calculated 
using the recording capacity C of the recording medium 106 and 
the maximum number of VOBs N, as follows: 

B = C/N ... (3) 

The memory size of the recording buffer memory 102 , calculated 
as described above, is the minimum memory size that is required 
to record one VTS using all of the recording capacity. When the 
memory size is decided as described above, the recording capacity 
of the recording medium can be utilized without loss, whereby the 
audio/video data of high audio/picture quality can be recorded. 



Further, when the memory size of the recording buffer memory 102 
is minimized, the cost of the recording apparatus can be reduced. 

Here, the memory size obtained by Formula (3) is the minimum 
required amount. The memory size of the recording buffer memory 
102 can be decided so as to have a margin m. That is, the memory 
size B is decided as follows: 

B = C/N + m ... (4) 
The margin m is obtained with taking variations in the VOB data 
size into consideration. This is because it is necessary to store 
at least one VOB in the recording buffer memory 102. 

Though the end of a VOB is decided at a time when the VOB 
data size has exceeded the predetermined threshold M as described 
above, the VOB size has variations because the VOB size is detected 
in VOBU units. The maximum variation is decided by the possible 
maximum data size of one VOBU. For example, assuming that the 
maximum rate of the recording data rate is Rt and the maximum time 
of a VOBU is T, the maximum size of the VOBU is (Rt x T) and the 
maximum variation is (RTxT) . Here, in the DVD video specif ications , 
Rt = 10.08 Mbps and T = 1.0 sec. 

Further, a delay time may occur after it is detected that 
the VOB size has exceeded the threshold M until the last VOBU is 
decided. The amount of data occurring during the delay time may 
be a cause of the variations in the VOB size. Assuming that this 
delay time is d, the maximum size of occurring data is obtained 
using the maximum recording data rate Rt by Formula (Rt x d) . 
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Therefore, the margin m is obtained by 
m = Rt x (T+d) ... (5) 

From the foregoing, the memory size B of the recording buffer 
memory 102 is obtained by 

B = C/N + Rt x (T+d) ... (6) 
Here, the memory size of the recording buffer memory 102 obtained^ 
as described above L is defined as the minimum value, and it may 
be larger. 

In this recording apparatus of the present invention, while 
the recording of VTSs on the recording medium 106 is started after 
the navigation pack is inserted in a VOB that is stored in the 
recording buffer memory 102, the compressed stream is output^ed 
continuously from the audio/video encoder 101 to the recording 
buffer memory 102 also in the recording process. Therefore, it 
is required to store a VOB that is being recorded while storing 
newly formed VOBs. Accordingly, the memory size is required to 
have such a margin that VOB data of a time period that is required 
for the recording process can be stored. 

Further, the time that is required to insert the navigation 
pack should also be considered, but this navigation pack insertion 
time can be considered as part of the recording time. 

In this recording apparatus of the present invention, the 
memory size of the recording buffer memory 102 may be decided by 
the recording data size of the VTS . The recording data size of 
the VTS is obtained on the basis of the recording rate and the 
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recording time. As described above, assuming for example that the 
memory size of the recording buffer memory 102 is b and the maximum 
number of VOBs in a VTS is N, the maximum recordable data size 
V of one VTS is obtained by V = N x b as shown by Formula (1) . 
When the size of data that is to be recorded is larger than the 
maximum recordable data size V, the data cannot be recorded. 

On the other hand, when the memory size of the recording 
buffer memory 102 is sufficiently increased, the maximum recordable 
data size V is increased, but the cost of the recording apparatus 
is adversely increased. Thus, the memory size B of the recording 
buffer memory 102 is calculated using the recording data size D 
and the maximum number of VOBs N, as follows: 

B = D/N ... (7) 

Thememory size of the recording buff er memory 102, calculated 
as described above, is the minimum required memory size that enables 
recording of the data size D. Therefore, the minimum memory size 
can be decided as well as the cost of the recording apparatus can 
be reduced. Here, the memory size obtained by Formula (7) is the 
minimum required amount, and thus the memory size of the recording 
buffer memory 102 may be decided so as to include the margin m. 
To be more specific, the memory size B can be obtained as follows: 

B = D/N + m ... (8) 
The margin m can be obtained in the same way as described above. 

In the recording apparatus of the present invention, the 
threshold M for the VOB division defined by the system controller 
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105 may be decided on the basis of the maximum number of VOBs in 
the VTS and the recording capacity of the recording medium 106. 
The generated VOB size is almost equal to the threshold M. Therefore, 
assuming that the maximum number of VOBs in one VTS is N , the maximum 
data size of the VTS is (M x N) . Thus, assuming the recording 
capacity of the recording medium 106 is C, the threshold M is obtained 
by 

M = C/N ... (9) 

The threshold M calculated as. described above is a minimum value 
that is required to record one VTS by using all of the recording 
capacity. When the threshold M is smaller than C/N, the maximum 
data size of one VTS is smaller than the recording capacity of 
the recording medium 106, so that part of the recording capacity 
is left unused, whereby the recording medium 106 cannot be used 
effectively. On the other hand, when the threshold M is 
sufficiently larger than C/N, the recording capacity of the 
recording medium 106 can be used effectively, but the VOB size 
is increased, whereby it is required to increase the memory size 
of the recording buffer memory 102, resulting in an increase in 
the cost of the recording apparatus . Therefore, when the threshold 
M is obtained by M = C/N, the efficiency becomes the highest. 

Alternatively, the threshold M may be decided on the basis 
of the recording data size of the VTS. The recording data size 
of the VTS is obtained by the recording rate and the recording 
time. Assuming that the maximum number of VOBs in one VTS is N 
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and the recording data size is D, the threshold M is obtained by 

M = D/N ... (10) 
The threshold M which is calculated as described above is the minimum 
value required to record data of the data size D. When the threshold 
M is smaller than D/N, the recordable data size becomes smaller 
than the recording data size D, whereby the data cannot be recorded. 
On the other hand, when the threshold M is sufficiently larger 
than D/N, the recordable data size can be increased, but the VOB 
size is increased, so that the memory size of the recording buffer 
memory 102 is required to be increased, whereby the cost of the 
recording apparatus is increased. Therefore, when the threshold 
M is obtained by M = D/N, the efficiency becomes highest. 

It is also possible to define the threshold M on the basis 
of the memory size of the recording buffer memory 102. Since at 
least one VOB is stored in the recording buffer memory 102, the 
threshold M is decided so that the memory of the recording buffer 
memory 102 does not overflow. Alternatively, it is possible that 
the threshold M is decided and thereafter the memory size of the 
recording buffer memory 102 is decided according to the threshold 
M. 

Further, in this recording apparatus of the present invention, 
when a recording stop command is issued by the user or the like 
at an arbitrary time, the audio/video encoder 101 notifies the 
system controller 105 of a VOBU that is being formed as the last 
VOBU in a VOB, and then the system controller 105 receives the 
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notice and controls the respective constituents, thereby 
completing the recording of the VOB and finishing the recording 
of the VTS . In this case, there are some cases where the VOB size 
is sufficiently smaller than the threshold M for the VOB division, 
but since the notice of the attribute information of each VOBU 
and the last VOBU notice are given to the system controller 105, 
the system controller 105 can generate the navigation data 
regardless of the data size of the VOB. By the above-mentioned 
operation, the VTS can be recorded in real time in a recording 
format that is compliant with the DVD video specifications, while 
recording can be stopped at an arbitrary time. 

The DVD video specifications define that the video playback 
time of the last VOBU in a VOB is from 0 . 4 sec — to 1 . 2 sec . Therefore, 
for example^ when a video signal to be coded is a video signal 
according to the NTSC system, one VOBU requires at least 12 frames. 
In this case, in the recording apparatus of the present invention, 
when the number of frames in the last VOBU does not reach 12 frames 
at a time when the recording stop command is posted, the coding 
process is continued so that the number of frames in the last VOBU 
becomes at least 12 frames, then the coding process is stopped, 
and the recording process is stopped at a time when the recording 
of the last VOBU is completed. 

Stop timing of the coding process will be described with 
reference to figure 5. Figure 5 is a diagram showing time flows 
of video frames. In figure 5, 13 and 118 are I frames, and P-l, 
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P6, P9, P12 and P15 are P frames. B-2, Bl, B2 , B4 , B5, B7, B8, 
BIO, Bll, B13, B14, B16, B17, B19 and B2 0 are B frames. Here, Bl 
frame is the top frame in a VOBU 500. 

It is assumed that a recording stop command is posted 
immediately before BIO frame. When the recording is stopped in 
accordance with the recording stop command with taking BIO frame 
as the last frame, the number of frames in the last VOBU 500 is 
10 frames and the playback time does not reach 0.4 sec. Thus, as 
shown in figure 5, the recording is stopped for example with taking 
P15 frame as the last frame. Accordingly, the number of frames 
in the last VOBU 500 becomes 15 frames, whereby the playback time 
exceeds the minimum playback time of the VOBU according to the 
DVD video specifications— _^i . e . , 0.4 secj_. Here, the recording 
is stopped at a time when the VOBU reaches 15 frames, while the 
recording stop timing can be any time as long as the playback time 
of the last VOBU is from 0.4 sec — to 1 . 2 sec. 

In the first embodiment , as an example of the recording format 
of the recording medium, the format that is compliant with the 
DVD video specifications has been described. However, this is only 
an example and the present invention is not limited to this format . 
For example, DVD video recording specif ications (Vers. 1.0) which 
was issued in September 1999, predicted upon that audio/video 
signals arc will be recorded on a DVD in real time. Since the DVD 
video recording specifications are predicted upon the real-time 
recording, when basically only past information is employed, 



-47- 



audio/video signals can be recorded in a recording format that 
is compliant with the DVD video recording specifications. 
According to the specifications, it is unnecessary to insert a 
navigation pack at the head of a VOBU like the DVD video 
specifications, and thus the above-mentioned operation of 
temporarily storing VOB data and then recording the data is not 
required. That is, the memory size of the recording buffer memory 
102 can be limited to a size that is smaller than that at the recording 
according to the DVD video specification. In the DVD video 
recording specifications and the DVD video specifications, 
audio/video signals are recorded on different types of recording 
media, respectively. A recording medium that is compliant with 
the DVD video recording specifications is for example a DVD-RAM. 
A recording medium that is compliant with the DVD video 
specifications is for example a DVD-R. 

An operation of the recording apparatus according to the 
present invention, in a case where audio/video signals are recorded 
on a recording medium, switching between a recording format that 
is compliant with the DVD video specifications and a recording 
format that is compliant with the DVD video recording specifications, 
will be described hereinafter. Initially, the type of the 
recording medium is decided by the decision unit 110 . For example 
when the decision unit 110 decides that the recording medium 106 
is a DVD-R, the system controller 105 inserts a navigation pack 
at the head of a VOBU as described above. On the other hand, for 
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example_^_ when the decision unit 110 decides that the recording 
medium 106 is a DVD-RAM, the recording processor 103 records a 
VTS in which no navigation pack has been inserted on the recording 
medium 106 and, after the recording of the VTS is finished, further 
records navigation data generated by the system controller 105 
in a VOBU management area on the recording medium 10 6 . The recording 
processor 103 carries out the above-mentioned respective recording 
processes under the control of the system controller 105. 

As described above, the recording apparatus of the present 
invention includes the decision unit 110 for deciding the type 
of a-recording medium, and the system controller 105 selects one 
of inserting a navigation pack at the head of a VOBU on the basis 
of the decision result of the decision unit 101 and controlling 
the recording processor 103 for recording navigation data in a 
VOBU management area on the recording medium 106. Therefore, 
audio/video signals canbe recorded on a recording medium conforming 
to a recording format that is compliant with the DVD video recording 
specifications or to a recording format that is compliant with 
the DVD video specifications. 

Further, in the foregoing descriptions, as examples of the 
recording format of a recording medium, the recording format that 
is compliant with the DVD video specifications and the recording 
format that is compliant with the DVD video recording specifications 
have been described. However, the present invention is not limited 
to these recording formats, and any recording format can be employed 
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as long as the data structure has a configuration of VOBUs and 
VOBs and information such as playback control information or search 
control information is placed in a predetermined position in each 
VOBU. 

In the first embodiment, the operation in the case where 
audio/video signals are recorded on a recording medium has been 
described, but the audio/video signals may be recorded in a 
recording/playback apparatus. 

Further, in the first embodiment, the operation in the case 
where audio/video signals are recorded on a recording medium has 
been described, but the present invention is not limited to this 
case and the audio/video signals may be played back while being 
recorded on a recording medium (simultaneous recording and 
playback) when the recording rate of the recording medium and the 
playback rate of the recording medium are sufficiently higher than 
the data rate. In the case of simultaneous recording and playback, 
not only the recording buffer memory but also a playback buffer 
memory is required . However, when audio/video signals are recorded 
in the recording format that is compliant with the DVD video 
recording specifications as described above, since it is 
unnecessary to temporarily store VOB data in a recording buffer 
memory, a memory space that is used for the recording process of 
the recording buffer memory can be reduced, and when a memory space 
that is not used for the recording process is allocated to the 
playback process, the recording buffer memory can be used as a 
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playback buffer memory. Accordingly, the simultaneous recording 
and playback can be roalizcd accomplished . 

Further, in the first embodiment, audio/video signals are 
recorded. However, only audio signals may be recorded, -e^ only 
video signals maybe recorded, or signals other than the audio/video 
signals may be recorded. The MPEG coding method has been described 
as the compressive coding method for the video signals, but the 
video signals can be compressively coded by another compressive 
coding method. Examples of another compressive coding method are 
a wavelet coding method, a vector quantization coding method and 
the like. 

Further, in the first embodiment, the VOB division is 
performed on the basis of the VOB data size. However, the VOB 
division may be performed on the basis of the compressive coding 
rate and the VOB playback time. This is because the VOB size can 
be obtained by multiplying the coding rate by the playback time. 
In this case, when the coding rate is lower, the playback time 
of a VOB is lengthened and when the coding rate is higher, the 
playback time of a VOB is shortened, and the VOB division timing 
is decided on the basis of the VOB playback time. 

The recording medium may be any medium such as an optical 
disk and a magnetic disk. Further, the recording medium can be 
a rewritable recording medium, or may be a recording medium that 
is capable of recording only once recording . 

In the first embodiment, the structure of the recording 
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apparatus has been described with reference to the block diagrams 
of figures 1, 2 and 3, but the .operations of the respective 
constituents of the recording apparatus may be realizedby software. 
In this case, the respective constituents are realized by software 
of an arithmetic processor. The program is stored in a 
semiconductor memory, or a recording medium such as an optical 
disk and a magnetic disk. 
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ABSTRACT OF THE DISCLOSURE 
The prcoont invention providco qA recording apparatus that 
rccordg capable of recording audio/video signals in real time in 
a recording format facilitating high- speed search . A- The recording 
apparatus includes a compressed stream (VOB) generated by an 
audio/video encoder 101 that is stored in a recording buffer memory 
3r9-2-. When data of one VOB is stored in the recording buffer memory 
-3r&2- 7 a navigation pack is generated from attribute information 
of VOBUs constituting the VOB . The , the generated navigation pack 
is inserted at the head of each VOBU, and then the data is recorded 
en— onto a recording medium 106 . 



